Handwritten Word Recognition Using MLP based Classifier: A Holistic Approach

نویسندگان

  • Ankush Acharyya
  • Sandip Rakshit
  • Ram Sarkar
  • Subhadip Basu
  • Mita Nasipuri
چکیده

Holistic Word Recognition is one of the new modalities for handwritten word identification. The holistic paradigm in handwritten word recognition treats the word as a single, indivisible entity and attempts to recognize words from their overall shape, as opposed to recognize the individual characters comprising the word. In the present work reports a longest-run based holistic feature, that has been used to classify word images belonging to different classes, using a neural network based classifier. To evaluate the technique, a few words from the handwritten documents of the CMATERdb1.2.1 dataset have been used. Frequently occurring English words are manually extracted from the handwritten pages and the accuracy of the technique is evaluated using a three fold cross-validation method. The best-case and average-case performances of the technique to the said data set are 89.9% and 83.24% respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Holistic Approach for Handwritten Hindi Word Recognition

Holistic word recognition attempts to recognize the entire word image as a single pattern. In general, it performs better than segmentation based word recognition model for known, fixed and small sized lexicon. The present work deals with recognition of handwritten words in Hindi in holistic way. Features like area, aspect ratio, density, pixel ratio, longest run, centroid and projection length...

متن کامل

Holistic Farsi handwritten word recognition using gradient features

In this paper we address the issue of recognizing Farsi handwritten words. Two types of gradient features are extracted from a sliding vertical stripe which sweeps across a word image. These are directional and intensity gradient features. The feature vector extracted from each stripe is then coded using the Self Organizing Map (SOM). In this method each word is modeled using the discrete Hidde...

متن کامل

Word level Script Identification from Bangla and Devanagri Handwritten Texts mixed with Roman Script

India is a multi-lingual country where Roman script is often used alongside different Indic scripts in a text document. To develop a script specific handwritten Optical Character Recognition (OCR) system, it is therefore necessary to identify the scripts of handwritten text correctly. In this paper, we present a system, which automatically separates the scripts of handwritten words from a docum...

متن کامل

A Harmony Search Based Wrapper Feature Selection Method for Holistic Bangla word Recognition

A lot of search approaches have been explored for the selection of features in pattern classification domain in order to discover significant subset of the features which produces better accuracy. In this paper, we introduced a Harmony Search (HS) algorithm based feature selection method for feature dimensionality reduction in handwritten Bangla word recognition problem. This algorithm has been...

متن کامل

Deep-Belief-Network based Rescoring for Handwritten Word Recognition

This paper presents a novel verification approach towards improvement of handwriting recognition systems using a word hypotheses rescoring scheme by Deep Belief Networks (DBNs). A recurrent neural network based sequential text recognition system is used at first to provide the N-best recognition hypotheses of word images. Word hypotheses are aligned with the word image to obtain the character b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013